-
Notifications
You must be signed in to change notification settings - Fork 25.6k
[BE]: Update 12.6 builds to CUDA 12.6.3 #141433
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[BE]: Update 12.6 builds to CUDA 12.6.3 #141433
Conversation
🔗 Helpful Links🧪 See artifacts and rendered test results at hud.pytorch.org/pr/141433
Note: Links to docs will display an error until the docs builds have been completed. ❌ 6 New Failures, 17 Unrelated FailuresAs of commit b61ff06 with merge base b75bb64 ( NEW FAILURES - The following jobs have failed:
FLAKY - The following jobs failed but were likely due to flakiness present on trunk:
This comment was automatically generated by Dr. CI and updates every 15 minutes. |
9e38b70
to
79fbb80
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@atalman I'll probably need this uploaded to the cu126 s3 bucket. Changes are from the https://docs.nvidia.com/cuda/cuda-toolkit-release-notes/index.html#cuda-toolkit-major-component-versions list
79fbb80
to
3097a63
Compare
44ab294
to
1afafa5
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
lgtm. Lets wait for green signal to land
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Regarding the pypi matrix packages, should be nvidia-nvjitlink-cu12==12.6.85 and nvidia-cublas-cu12==12.6.4.1 for 12.6.3
9157523
to
b61ff06
Compare
Adding the ciflow/binaries label to test the x86 nightly wheel. |
@atalman Looks like we'll need those binaries uploaded to S3 |
@Skylion007 let me upload this now |
Remaining failures appear unrelated. |
@pytorchbot merge -i |
Merge failedReason: 17 jobs have failed, first few of them are: windows-binary-wheel / wheel-py3_9-xpu-test, windows-binary-wheel / wheel-py3_11-xpu-test, windows-binary-wheel / wheel-py3_13-xpu-test, windows-binary-wheel / wheel-py3_10-xpu-test, windows-binary-wheel / wheel-py3_12-xpu-test Details for Dev Infra teamRaised by workflow job |
@pytorchmergebot merge -f "failures are not related" |
Merge startedYour change will be merged immediately since you used the force (-f) flag, bypassing any CI checks (ETA: 1-5 minutes). Please use Learn more about merging in the wiki. Questions? Feedback? Please reach out to the PyTorch DevX Team |
Update CUDA 12.6 to Update 3 and make cusparse-lt 0.6.3? pytorch#141365 Was going to leave some comments on pytorch#141365, but though it was just faster to open a PR here. Pull Request resolved: pytorch#141433 Approved by: https://github.com/atalman
function install_cusparselt_063 { | ||
# cuSparseLt license: https://docs.nvidia.com/cuda/cusparselt/license.html | ||
mkdir tmp_cusparselt && pushd tmp_cusparselt | ||
wget -q https://developer.download.nvidia.com/compute/cusparselt/redist/libcusparse_lt/linux-x86_64/libcusparse_lt-linux-x86_64-0.6.3.2-archive.tar.xz |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Missed this, should have been libcusparse_lt-linux-sbsa-0.6.3.2. This is causing error /usr/local/cuda/lib64/libcusparseLt.so: error adding symbols: file in wrong format
Update CUDA 12.6 to Update 3 and make cusparse-lt 0.6.3? #141365 Was going to leave some comments on #141365, but though it was just faster to open a PR here.
related to #138440